Automatic Discovery of Lexical Patterns using Pattern Extraction Algorithm to Identify Personal Name Aliases with Entities

نویسندگان

  • A. Muthusamy
  • A. Subramani
چکیده

The personal name aliases are extremely significant in information retrieval to retrieve complete information about a personal name from the web, as some of the web pages of the person may also be referred by his or her alias name / nick name / real name. There is a rapid growth in people searching where the personal name aliases are concerned. We proposed a pattern generator which includes automatic: lexical pattern extraction algorithm and attribute extraction algorithm. We exploit three data set of known Personal names (consisting of alias name, real name, and nick name), Profession and location names of a person as training semi-structured data set to efficiently extract lexical patterns. The extracted patterns are ranked according to F-Score. It conveys information related to alias names from contingency table returned by web search engine. The extracted lexical patterns (profession pattern and location name pattern) are often used to optimize candidate personal name aliases with attributes of a person availed in the contingency table, the non-frequent items are discarded from the contingency table. Next, we ranking the candidate alias in contingency table, Graph mining ranking algorithm with various similarity measures are used then to measure the strength of association between a name and a candidate alias, co-occurrence statistics are computed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatically Extracting Personal Name Aliases from the Web

An entity can be referred by multiple name aliases on the web. Extracting aliases of an entity is important for various tasks such as identification of relations among entities, automatic metadata extraction and entity disambiguation. To extract relations among entities properly, one must first identify those entities. Aliases of an entity are useful as metadata for that entity and can be used ...

متن کامل

Identification of Personal Name Aliases on the Web

Extracting aliases of an entity is important for various tasks such as identification of relations among entities, web search and entity disambiguation. To extract relations among entities properly, one must first identify those entities. We propose a novel approach to find aliases of a given name using automatically extracted lexical patterns. We exploit a set of known names and their aliases ...

متن کامل

Automatically Extracting Personal Name Aliases from the Web

Extracting aliases of an entity is important for various tasks such as identification of relations among entities, web search and entity disambiguation. To extract relations among entities properly, one must first identify those entities. We propose a novel approach to find aliases of a given name using automatically extracted lexical patterns. We exploit a set of known names and their aliases ...

متن کامل

Automatic Detection of Name Disambiguation and Extracting Aliases for the Personal Name

An individual can be referred by multiple name aliases on the web. Extracting aliases of a name is important in information retrieval, sentiment analysis and name disambiguation. We propose a novel approach to find aliases of a given name using automatically extracted lexical pattern based approach. We exploit set of known names and their aliases as training data and extract lexical patterns th...

متن کامل

A Survey of Automatic Extraction of Personal Name Alias from the Web

The survey paper explains about the extraction and retrieval of personal name alias using various techniques from the web with the help of web crawls. The existing methods help to improve the depth of knowledge relevant to alias extraction and retrieval process. It also describes about how the aliases are ranked, then page counts on the web, word co-occurrence using anchor text and techniques l...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015